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A NOVEL GROWTH FACTOR AND A GENETIC SEQUENCE 
ENCODING SAME 

5 The present invention relates generally to an isolated molecule having vascular 
endothelial growth factor-like properties and to a genetic sequence encoding same. The 
molecule will be useful in the development of a range of therapeutics and diagnostics 
useful in the treatment, prophylaxis and/or diagnosis of conditions requiring enhanced 
or diminished vasculature and/or vascular permeability. The molecule of the present 
10 invention is also a useful effector of primary and central neurons and is capable of 
inducing astroglial proliferation. 

Bibliographic details of the publications referred to by author in this specification are 
collected at the end of the description. Sequence Identity Numbers (SEQ ID NOs.) for 
15 the nucleotide and amino acid sequences referred to in the specification are defined 
following the bibliography. 

Throughout this specification, unless the context requires otherwise, the word 
"comprise", or variations such as "comprises" or "comprising", will be understood to 
20 imply the inclusion of a stated element or integer or group of elements or integers but 
not the exclusion of any other element or integer or group of elements or integers. 

Vascular endothelial growth factor (hereinafter referred to as "VEGF"), also known as 
vasoactive permeability factor, is a secreted, covalently linked homodimeric glycoprotein 

25 that specifically activates endothelial tissues (Senger et al, 1993). A range of functions 
have been attributed to VEGF such as its involvement in normal angiogensis including 
formation of the corpus luteum (Yan et al, 1993) and placental development (Sharkey 
et al, 1993), regulation of vascular permeability (Senger et al, 1993), inflammatory 
angiogenesis (Sunderkotter et al, 1994) and autotransplantation (Dissen et al, 1994) and 

30 human diseases such as tumour promoting angiogenesis (Folkman & Shing, 1992), 
rheumatoid arthritis (Koch et al, 1994) and diabetes related retinopathy (Folkman & 
Shing, 1992). 
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VEGF is, therefore, an important molecule making it a potentially valuable target for 
research into therapeutics, prophylactics and diagnostic agents based on VEGF or its 
activities. There is also a need to identify homologues or otherwise related molecules 
for use as an alternative to VEGF or in conjunction with VEGF. 

5 

In work leading up to the present invention, the inventors sought the multiple endocrine 
neoplasia type I susceptibility gene (MEN1). Surprisingly, the inventors discovered that 
a genetic sequence excluded as a candidate for the MEN1 gene was nevertheless a new 
growth factor having some similarity to VEGF. Furthermore, the growth factor of the 
0 present invention is an effector molecule for primary and central neurons. 

Accordingly, one aspect of the present invention comprises a biologically isolated 
proteinaceous molecule comprising a sequence of amino acids which: 

(i) is at least about 15% similar to the amino acid sequence set forth in SEQ ID 
NO:2; and 

(ii) is at least 5% dissimilar to the amino acid sequence set forth in SEQ ID NO:2. 

Another aspect of the present invention provides a biologically isolated proteinaceous 
molecule having the following characteristics: 

(i) comprises an amino acid sequence having at least about 15% similarity but at 
least about 5% dissimilarity to all or part of the amino acid sequence set forth 
in SEQ ID NO:2; 

(ii) exhibits at least one property in common with VEGF. 

A related aspect of the present invention contemplates a biologically isolated 
proteinaceous molecule having the following characteristics: 

(i) comprises an amino acid sequence having at least about 15% similarity but at 
least about 5% dissimilarity to the amino acid sequence set forth in SEQ ID 
NO:2; 

(ii) exhibits at least one of the following properties: 

(a) ability to induce proliferation of vascular endothelial cells; 

(b) ability to interact with flt-\lflk-\ family of receptors; 
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(c) ability to induce cell migration, cell survival and/or an increase in 
intracellular levels of alkaline phosphatase. 

By "biologically isolated" is meant that the molecule has undergone at least one step of 
5 purification from a biological source. Preferably, the molecule is also biologically pure 
meaning that a composition comprises at least about 20%, more preferably at least about 
40%, still more preferably at least about 65%, even still more preferably at least about 
80-90% or greater of the molecule as determined by weight, activity or other convenient 
means, relative to other compounds in the composition. Most preferably, the molecule 
10 is sequencably pure. 

Another preferred aspect of the present invention provides the molecule in recombinant 
form. 

15 According to this aspect of the present invention, there is provided a recombinant 
molecule comprising a sequence of amino acids which: 

(i) is at least about 15% similar to the amino acid sequence set forth in SEQ ID 
NO:2; and 

(ii) is at least 5% dissimilar to the amino acid sequence set forth in SEQ ID NO:2. 

20 

A related aspect of the present invention is directed to a recombinant molecule having 
the following characteristics: 

(i) comprises an amino acid sequence having at least about 15% similarity but at 
least about 5% dissimilarity to all or part of the amino acid sequence set forth 

25 in SEQ ID NO:2; 

(ii) exhibits at least one property in common with VEGF. 

A further related aspect of the present invention contemplates a recombinant molecule 
having the following characteristics: 
30 (i) comprises an amino acid sequence having at least about 15% similarity but at 

least about 5% dissimilarity to the amino acid sequence set forth in SEQ ID 

NO:2; 
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(ii) exhibits at least one of the following properties: 

(a) ability to induce proliferation of vascular endothelial cells; 

(b) ability to interact with flt-Vflk-l family of receptors; 

(c) ability to induce cell migration, cell survival and/or an increase in 
5 intracellular levels of alkaline phosphatase. 

The present invention also contemplates genomic or partial genome clones encoding a 
proteinaceous molecule having at least about 15% amino acid similarity but at least 
about 5% dissimilarity to SEQ ID NO:l. 

10 

The amino acid sequence set forth in SEQ ID NO:2 corresponds to human VEGF 
(referred to herein as "VEGF I65 "). Accordingly, the molecule of the present invention 
is VEGF-like or is a homologue of VEGF but comprises an amino acid sequence which 
is similar but non-identical to the amino sequence of VEGF. Although the present 
15 invention is exemplified using a human VEGF-like molecule, this is done with the 
understanding that the instant invention contemplates the homologous molecule and 
encoding sequence from other mammals such as livestock animals (e.g. sheep, pigs, 
horses and cows), companion animals (e.g. dogs and cats) and laboratory test animals 
(e.g. mice, rats, rabbits and guinea pigs) as well as non-mammals such as birds (e.g. 
20 poultry birds), fish and reptiles. In a most preferred embodiment, the VEGF-like 
molecule is of human origin and encoded by a gene located at chromosome 1 lql3. The 
present invention extends, therefore, to the genomic sequence or part thereof encoding 
the subject VEGF-like molecule. 

25 Preferably, the percentage similarity is at least about 30%, more preferably at least about 
40%, still more preferably at least about 50%, still even more preferably at least about 
60-70%, yet even more preferably at least about 80-95% to all or part of the amino acid 
sequence set forth in SEQ ID NO:2. ' 



30 



In a particularly preferred embodiment, the VEGF-like molecule of the present invention 
comprises a sequence of amino acids as; set forth in SEQ ID NO:4 or is a part, fragment, 
derivative or analogue thereof. Particularly preferred similarities include about 19-20%, 
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and 29-30%. Reference herein to derivatives also includes splice variants. Accordingly, 
the present invention extends to splice variants of SOM1 75. Examples of splice variants 
contemplated by the present invention include but are not limited to variants with an 
amino acid sequence substantially as set forth in at least one of SEQ ID NO:6, SEQ ID 
5 NO:8 and/or SEQ ID NO: 1 0 or mutants or derivatives or further splice variants thereof. 

Another embodiment provides a recombinant molecule having the following 
characteristics: 

(i) an amino acid sequence substantially as set forth in SEQ ID NO:4 or having at 
10 least about 15% similarity to all or part thereof provided that said amino acid 

sequence is at least about 5% dissimilar to all or part of the amino acid sequence 
set forth in SEQ ID NO:2; 

(ii) exhibits at least one biological property in common with VEGF. 

15 Another embodiment provides a recombinant molecule having the following 
characteristics: 

(i) an amino acid sequence substantially as set forth in SEQ ID NO:6 or having at 
least about 1 5% similarity to all or part thereof provided that said amino acid 
sequence is at least about 5% dissimilar to all or part of the amino acid sequence 

20 set forth in SEQ ID NO:2; 

(ii) exhibits at least one biological property in common with VEGF. 

Another embodiment provides a recombinant molecule having the following 
characteristics: 

25 (i) an amino acid sequence substantially as set forth in SEQ ID NO:8 or having at 
least about 1 5% similarity to all or part thereof provided that said amino acid 
sequence is at least about 5% dissimilar to all or part of the amino acid sequence 
set forth in SEQ ID NO:2; 
(ii) exhibits at least one biological property in common with VEGF. 



Another embodiment provides a recombinant molecule having the following 
characteristics: 
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(i) an amino acid sequence substantially as set forth in SEQ ID NO: 10 or having at 
least about 1 5% similarity to all or part thereof provided that said amino acid 
sequence is at least about 5% dissimilar to all or part of the amino acM sequence 
set forth in SEQ ID NO:2; 
5 (ii) exhibits at least one biological property in common with VEGF. 

Such properties of VEGF include at least one of: 

(a) ability to induce proliferation of vascular endothelial cells; 

(b) an ability to interact with flt-\/flk-\ family of receptors; 

1 0 (c) an ability to induce cell migration, cell survival and/or an increase in intracellular 
levels of alkaline phosphatase. 

In accordance with the present invention, a preferred similarity is at least about 40%, 
more preferably at least about 50% and even more preferably at least about 65% 
15 similarity. 

Still a further aspect of the present invention contemplates a peptide fragment 
corresponding to a portion of the amino acid sequence set forth in SEQ ID NO:4 or a 
splice variant thereof such as set forth in SEQ ID NO:6, SEQ ID NO: 8 or SEQ ID 

20 NO: 10 or a chemical equivalent thereof. The biologically isolated or recombinant 
molecule of the present invention may be naturally glycosylated or may comprise an 
altered glycosylation pattern depending on the cells from which it is isolated or 
synthesised. For example, if produced by recombinant means in prokaryotic organisms, 
the molecule would be non-glycosylated. The molecule may be a full length, naturally 

25 occurring form or may be a truncated or otherwise derivatised form. 

Yet another aspect of the present invention is directed to a nucleic acid molecule 
encoding the VEGF-like molecule herein described. More particularly, the present 
invention provides a nucleic acid molecule comprising a sequence of nucleotides 
30 substantially as set forth in SEQ ID NO:3 or having at least 15% similarity to all or part 
thereof or being capable of hybridising under low stringency conditions to a reverse 
complement of the nucleotide sequence as set forth in SEQ ID NO:3 provided that the 
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nucleic acid sequence having at least 15% similarity but at least 30% dissimilarity to the 
nucleotide sequence as set forth in SEQ ID NO:3. The nucleotide sequence set forth in 
SEQ ID NO:3 is also referred to herein as "SOM175". Preferably, the percentage 
dissimilarity is about 35%, more preferably about 39% and even more preferably about 
40-50% or greater. 

For the purposes of defining the level of stringency, reference can conveniently be made 
to Sambrook et al (1989) at pages 9.47-9.51 which is herein incorporated by reference 
where the washing steps disclosed are considered high stringency. A low stringency is 
defined herein as being in 4-6X SSC/0. 1-0.5% w/v SDS at 37-45°C for 2-3 hours. 
Depending on the source and concentration of nucleic acid involved in the hybridisation, 
alternative conditions of stringency may be employed such as medium stringent 
conditions which are considered herein to be 1-4X SSC/0.25-0.5% w/v SDS at > 45°C 
for 2-3 hours or high stringent conditions considered herein to be 0.1-1 X SSC/0. 1% w/v 
SDS at 60°C for 1-3 hours. 

The present invention further contemplates a nucleic acid molecule which encodes a 
VEGF-like molecule as hereinbefore described having at least 1 5% nucleotide sequence 
homology to SEQ ID NO:3. Preferred levels of homology include at least about 40%, 
more preferably around 60-70%. 

The present invention is further directed to the murine homologue of human VEGF 
(referred to herein as "mVRF"). The mVRF has approximately 85% identity and 92% 
conservation of amino acid residues over the entire coding region compared to human 
VEGF. The mVRF is encoded by a nucleic acid molecule comprising a nucleotide 
sequence substantially as set forth in Figure 9. 

The VEGF-like molecule of the present invention will be useful in the development of 
a range of therapeutic and/or diagnostic applications alone or in combination with other 
molecules such as VEGF. The present invention extends, therefore, to pharmaceutical 
compositions comprising the VEGF-like molecule or parts, fragments, derivatives, 
homologues or analogues thereof together with one or more pharmaceutically acceptable 
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carriers and/or diluents. Furthermore, the present invention extends to vectors 
comprising the nucleic acid sequence set forth in SEQ ID NO:3 or having at least about 
15%, more preferably about 40% and even more preferably around 60-70% similarity 
thereto but at least 30% and more preferably around 39% dissimilarity thereto and host 
cells comprising same. In addition, the present invention extends to ribozymes and 
antisense molecules based on SEQ ID NO:3 as well as neutralizing antibodies to the 
VEGF-like molecule. Such molecules may be useful in ameliorating the effects of, for 
example, over expression of VEGF-ltke genes leading to angiogenesis or vascularization 
of tumours. 

Another aspect of the present invention contemplates a method of inducing astroglial 
proliferation in a mammal, said method comprising administering to said mammal an 
effective amount of a recombinant proteinaceous molecule having the characteristics: 

(i) comprises an amino acid sequence having at least about 15% similarity 
but at least about 5% dissimilarity to the sequence set forth in SEQ ID 
NO:2; 

(ii) exhibits at least one property in common with vascular endothelial growth 
factor (VEGF), 

said administration being for a time and under conditions sufficient to induce astroglial 
proliferation. 

Preferably, the recombinant proteinaceous molecule comprises the amino acid sequence 
set forth in SEQ ID NO:3 or SEQ ID NO:6. 

A further aspect of the present invention provides a method of promoting neural survival 
and/or proliferation in a mammal, said method comprising administering to said mammal 
an effective amount of a recombinant proteinaceous molecule having the characteristics: 

(i) comprises an amino acid sequence having at least about 15% similarity 
but at least about 5% dissimilarity to the sequence set forth in SEQ ID 
NO:2; 

(ii) exhibits at least one property in common with vascular endothelial growth 
factor (VEGF), 
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said administration being for a time and under conditions sufficient to induce astroglial 
proliferation. 

Preferably, the recombinant proteinaceous molecule comprises the amino acid sequence 
5 set forth in SEQ ID NO:3 or SEQ ID NO:6. 

The present invention also contemplates antibodies to the VEGF-like molecule or nucleic 
acid probes to a gene encoding the VEGF-like molecule which are useful as diagnostic 
agents. 

10 

The present invention is further described by reference to the following non-limiting 
Figures and/or Examples. 

In the Figures: 

15 

Figure 1 Nucleotide sequence [SEQ ID NO:l] and corresponding amino acid 
sequence [SEQ ID NO:2] of VEGF, 65 . 

Figure 2 Nucleotide sequence [SEQ ID NO:3] and corresponding amino acid 
20 sequence [SEQ ID NO:4] of SOM175. 

Figure 3 Results of BLAST search with SOM175 protein sequence. 

Figure 4 BESTFIT alignment of VEGF cDNA and SOM175 cDNA. 

25 

Figure 5 Multiple alignment of VEGF I65 with SOM175 and its splice variants at 
the nucleotide level. 

Figure 6 Multiple alignment of VEGF 165 with SOM175 and its splice variants at 
30 the amino acid level. 



Figure 7 



Diagrammatic representation of SOM175 and its splice variants. 
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Figure 8(a) Diagrammatic representation of genomic structure of human SOM175 
genomic showing exon/intron map. 

Figure 8(b) Diagrammatic representation of genomic structure of human SOM175 
5 showing exon/intron boundries. 

Figure 9 Nucleotide and predicted peptide sequences derived from mVRF cDNA 
clones. Numbering of nucleotides are given on the left, starting from the A of the 
initiation codon. Amino acids are numbered on the right, starting from the first residue 
0 of the predicted mature protein after the putative signal peptide has been removed. The 
alternately spliced region is double underlined and the resulting peptide sequence from 
each mRNA is included. A potential polyadenylation signal is indicated in boldface. 
Start and stop codons of mVRF, 67 and mVRF 186 are underlined and a polymorphic AC 
repeat in the 3' UTR is indicated by a stippled box. The positions of intron/exons 
5 boundaries are indicated by arrowheads. 

Figure 10 BESTFIT alignments of human and murine VRF protein isoforms. A: 
mVRF i67 ^ hVRF 167 . B: mVRF 186 and hVRF 186 from the point where the sequences 
diverge from the respective 1 67 amino acid isoforms. Amino acid identities are marked 
with vertical bars and conserved amino acids with colons. An arrow marks the predicted 
signal peptide cleavage site of human and mouse VRF. 

Figure 11 BESTFIT alignment of mVRF 167 and mVEGF 188 (Breier et at, 1992) 
peptide sequences. An arrow marks the signal peptide cleavage site of mVEGF. 
Identical amino acids are indicated by vertical bars and conservative substitutions by 
colons. Numbering of amino acids is as described in the legend to Figure 9. 

Figure 12 Comparison of gene structure between VRF (a generic VRF gene is 
shown since the intron/exon organisation of the mouse and human homologues is almost 
identical) and other members of the human VEGF/PIGF/PDGF gene family. Exons are 
represented by boxes. Protein coding regions and untranslated regions are shown by 
filled and open sections respectively. The hatched region in VRF indicates the 
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additional 3' UTR sequence formed by alternate splicing of the VRF lg6 isoform. 
Potential alternate splice products of each gene are shown. 

Figure 13 Autoradiogram of a Northern blot of total RNA from various adult mouse 
5 tissues (as indicated) hybridised with an mVRF cDNA clone. A major transcript of 1.3 
kb was detected in all samples. 

Figure 14 Film autoradiographs (A-C) and dark-field micrographs (D-E) illustrating 
the expression pattern of mVRF and mRNA in the mouse. In the E 14 mouse embryo 
(A) positive signals are present over the developing heart (Ha) and cerebral cortex (Cx). 
A low background signal is also present over other tissues in the section. In the El 7 
embryo (B) and the heart (Ha) is clearly visible due to a strong hybridisation signal. An 
equally strong signal is present over brown adipose tissue (Fa) in the back and around 
the thoracic cage. A moderate hybridisation signal is present over the spinal cord (SC) 
and the tongue (T). The background signal is reduced compared with the El 4 embryo. 
In the young adult mouse (C-D), positive signals are present over the heart (Ha) and 
adipose tissue (Fa) around the thoracic cage, while, for example, the lungs (Lu) are 
unlabeled). The hybridisation signal over the heart is evenly distributed over the entire 
left ventricle, including papillary muscles (D). In the E17 heart hybridised with an 
excess of cold probe, no positive signal is present (E). Scale bars = 0.5 mm (A), 1.2 
mm (B), 1 mm (C), 0.3 mm (D), 0. 1 mm (E). 

Figure 15 Dark - (A and C) and bright-field (B and D) micrographs showing mVRF 
mRNA expression in mouse adipose tissue (A-B) and spinal cord (C-D). A strong 
25 hybridisation signal is present over fat (A), as shown by the strong labeling in Sudan 
black stained sections (B). A weak signal is present also in skeletal muscle (M in A-B). 
In the adult spinal cord (C) the mVRF probes gave a neuronal staining pattern over the 
gray matter. Toloudine counterstaining showing that motoneurons in the ventral horn 
(D), interneurons in the deep part of the dorsal horn and around the central canal (not 
30 shown) where largely positive for mVRF mRNA. Scale bars = 0.1 mm (A), 0.1 mm 
(B), 0.25 mm (C), 0.015 mm (D). 



15 
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Figure 16 Effect of VEGF on embryonic day 8 (E8) chick sensory neurons as 
determined by % survival, % neurite outgrowth and average neurite length (urn). 

Figure 17 Effects of VEGF and SOM175 on chick glia. Tested were CNS glial, 
5 peripheral glia and CNS oligodendrocytes. 

Figure 18 Effect of various SOM175 proteins on mouse astroglial cells. ■ 3 H (cpm) 



1. 


FGF-2 (10 ng/ml) positive control 


2. 


SOMaX6* 1 ng/ml 


3. 


SOMaX6 10 ng/ml 


4. 


SOMaX6 100 ng/ml 


5. 


SOMaX6 1000 ng/ml 


6. 


SOMaX6 1000 ng/ml, no heparin 


7. 


SOMX6** 1 ng/ml 


8. 


SOMX6 10 ng/ml 


9. 


SOMX6 100 ng/ml 


10. 


SOMX6 1000 ng/ml 


11. 


SOMX6 1000 ng/ml, no heparin 



This refers to SOM175 absent exon 6; 
** This refers to SOM175. 



Figure 19 Effect of various SOM175 proteins on mouse oligodenroglial cells, 
(cpm) 

1. FGF-2 (10 ng/ml) positive control 

> 2. SOMaX6* 1 ng/ml 

3. SOMaX6 10 ng/ml 

4. SOMaX6 100 ng/ml 

5. SOMaX6 1000 ng/ml 

6. SOMaX6 1000 ng/ml., no heparin 
) 7. SOMX6** 1 ng/ml 

8. SOMX6 10 ng/ml 

9. SOMX6 100 ng/ml 
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10. SOMX6 1 000 ng/mJ 

1 1 . SOMX6 1 000 ng/ml, no heparin 
This refers to SOM175 absent exon 6; 

** This refers to SOM175. 

5 

Figure 20 Effect of various SOM175 proteins on mouse forebrain neurons. ■ % 



survival 




1. 


FGF-2 (10 ng/ml) positive control 


2. 


SOMaX6* 1 ng/ml 


3. 


SOMaX6 10 ng/ml 


4. 


SOMaX6 100 ng/ml 


5. 


SOMaX6 1000 ng/ml 


6. 


SOMaX6 1000 ng/ml, no heparin 


7. 


SOMX6** 1 ng/ml 


8. 


SOMX6 10 ng/ml 


9. 


SOMX6 100 ng/ml 


10. 


SOMX6 1000 ng/ml 


11. 


SOMX6 1000 ng/ml, no heparin 


This refers 


to SOM175 absent exon 6; 



20 ** This refers to SOM175. 



WO 96/27007 



PCT/AU96/00094 



TABLE 1 

SUMMARY OF SEQUENCE IDENTITY NUMBERS 



5 



SEQ ID NO.T 


Nucleotide sequence of VEGF 165 


SEQ ID NO:2 


Amino acid sequence of VEGF 165 


SEQ ID NO:3 


Nucleotide sequence of SOM175 (VEGF-Iike molecules) 


SEQ ID NO:4 


Amino acid sequence of SOM175 


SEQ ID NO:5 


Nucleotide sequence of SOM175 absent exon 6 


SEQ ID NO:6 


Amino acid sequence of SOM175 absent exon 6 


SEQ ID NO:7 


Nucleotide sequence of SOM175 absent exon 6 and exon 7 


SEQ ID NO:8 


Amino acid sequence of SOM175 absent exon 6 and exon 7 


SEQ ID NO:9 


Nucleotide sequence of SOM175 absent exon 4 


SEQ ID NO: 10 


Amino acid sequence of SOM175 absent exon 4 


SEQ ID NO: 11 


Oligonucleotide 


SEQ ID NO: 12 


Oligonucleotide 


SEQ ID NO: 13 


Oligonucleotide 


SEQ ID NO:14 


Oligonucleotide 



20 

EXAMPLE 1 

Human cDNA clones 

The original SOM175 cDNA was isolated by screening a human foetal brain library 
25 (TtzapII, Stratagene) with the cosmid D11S750 (Larsson et al, 1992). The plasmid was 
excised "in vivo" and a single 1.1 kb cDNA was obtained. Three independent SOM175 
cDNAs clones were also isolated from a human foetal spleen library (Strategane, Uni- 
zap) using the above-mentioned SOM[175 insert as a probe. Three clones were obtained: 
SOM175-4A, -5 A and -6A. SOM175-5A is an alternately spliced clone with exon 4 
30 being absent (SOM175-e4). These library screens were performed using hybridisation 
conditions recommended by the manufacturer of the library (Stratagene) and random 
primed insert of SOM175. 
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Two partial human SOM175 cDNAs have also isolated from a XGT\ 1 human melanoma 
cell line A2058 library (Clontech) cDNA library screens were performed using 
hybridisation conditions described by Church and Gilbert, 1984). In each case, the 
probe was generated by random priming of a PCR product derived from SOM175 (18f- 
5 700r). 

Mouse cDNA Clones 

Human SOM175 was also used to screen a mouse neonatal whole brain cDNA library 
(Unizap, Stratagene). Four non-chimeric clones were isolated: M175-A, B, C, D. All 
10 clones were partial cDNAs and M175-C contained several introns. Three of these 
cDNAs lacked the exon 6. 

Another clone referred to as Ml was completely sequenced and was found to contain 
the full open reading frame plus part of the 5'utr and total 3'utr. 

15 

EXAMPLE 2 
DNA SEQUENCE ANALYSIS 
The entire sequence of the cDNA clone (SOM175) was compiled and is shown in 
Figure 2 with its corresponding amino acid sequence. This sequence was screened for 
20 open reading frames using the MAP program (GCG, University of Wisconsin). A single 
open reading frame of 672bp was observed (see Figure 2). There appears to be little 5' 
untranslated sequences (2bp). The 3' untranslated region appears to be complete as it 
includes a poly-adenylation signal and poly-A tail. 

25 Database homology searches were performed using the BLAST algorithm (run at NCBI, 
USA). This analysis revealed homology to several mammalian forms of VEGF (see 
Figure 3). The amount of homology between SOM175 and human VEGF 165 was 
determined using the BESTFIT program (GCG, University of Wisconsin; see Figures 
4 and 5). Nucleotide homology was estimated at 69.7% and protein homology was 

30 estimated as at least 33.3% identity and 52.5% conservation using BESTFIT analysis. 
BLAST analysis on nucleotide sequences revealed the almost complete match to a 
human expressed sequence tag EST06302 (Adams et al, 1993). 
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These data indicate that SOM175 encodes a growth factor that has structural similarities 
to VEGF. Both genes show start and stop codons in similar positions and share discrete 
blocks of homology. All 8 cysteines as well as a number of other VEGF residues 
believed to be involved in dimerisation are conserved. These residues are Cysteine-47, 
5 Proline-70, Cysteine- 72, Valine- 74, Arginine-77, Cysteine-78, Glycine-80, Cysteine-8 1 , 
Cysteine-82, Cysteine-89, Proline-91, Cysteine- 122 and Cysteine- 124 and are shown in 
Figure 6. Given the structural conservation between VEGF and the SOM175 gene 
product it is also possible that they share functional similarities. It is proposed that 
SOM175 encodes a VEGF-like molecule that shares some properties with VEGF but has 
1 0 unique properties of its own. The nucleotide sequence and corresponding amino acid 
sequence of VEGF, 65 is shown in Figure 1. 

EXAMPLE 3 

The percentage similarity and divergence between VEGF 165 family and SOM175 family 
15 (protein) were analysed using the Clustal method, MegAlign Software, DNASTAR, 
Wisconsin. The results are shown in Tables 2.1 and 2.2. The alternatively spliced 
forms of SOM175 are abbreviated to SOM715-e6 where all of exon 6 is deleted; 
SOM715-e6 and 7 where all of exons 6 and 7 are deleted; and SOM175-e4 where all 
of exon 4 is deleted. The spliced form of SOM175 are shown in Figure 7. Genomic 
20 maps of SOM175 showing intron/exon boundaries are shown in Figure 8a and 8b. 
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Table 2.1 

A Percent nucleotide similarity between splice variants of SOM175 and 
5 human VEGF,« 





VEGF 165 SOML75 


SOM175-e6 


SOM175-e6&7 


SOM175-e4 


VEGF 165 


*** 34.9 


39.7 


41.4 


37.0 


SOM175 




98.9 


95.1 


99.2 


SOM175-e6 






98.8 


84.0 


SOM175-e6<&7 








80.3 


SOM175-e4 








*** 



15 
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B Percent nucleotide divergence between splice variants of SOM175 and 
human VEGF 16 , 





VEGF 165 SOM175 


SOM175-e6 


SOM175-e6&7 


SOM175-e4 


VEGF 165 


*** 41.7 


41.6 


41.7 


41.8 


SOM175 




0.2 


0.2 


0.0 


SOM175-e6 




*** 


0.0 


0.2 


SOM175-e6&7 








0.3 


SOM175-e4 











Table 2.2 

15 A Percent amino acid identity between splice variants of SOM175 and 
human VEGF,« 





VEGF 165 


SOM175 


SOM175-e6 


SOM175-e6&7 SOM175-e4 




+ ** 


31.4 


42.3 


33.5 


40.6 


SOM175 






74.7 


73.7 


99.1 


SOM175-e6 






*** 


76.8 


99.1 


SOM175-e6&7 








*** 


99.1 


SOM175-e4 










*** 
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B Percent amino acid divergence between splice variants of SOM175 and 
human VEGF 16S 
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VEGF 165 SOM175 


SOM175-e6 


SOM175-e6&7 


SOM175-e4 




VEGF 165 


*** 65.7 


55.4 


54.6 


57.4 




SOM175 




19.9 


4.2 


0.0 




SOM175-e6 




*** 


0.0 


0.0 


10 


SOM175-e6&7 








0.0 




SOM175-e4 











15 EXAMPLE 4 

BIO ASS AYS TO DETERMINE THE FUNCTION OF SOM175 

Assays are conducted to evaluate whether SOM175 has similar activities to VEGF on 
endothelial cell function, angiogenesis and wound healing. Other assays are 
performed based on the results of receptor binding distribution studies. 

20 

Assays of endothelial cell function 

Endothelial cell proliferation. Endothelial cell growth assays as described in Ferrara 
& Henzel (1989) and in Gospodarowicz et al (1989). 

25 Vascular permeability assay. This assay, which utilises the Miles test in guinea pigs, 
will be performed as described in Miles & Miles (1952). 

Cell adhesion assay. The influence of SOM175 on adhesion of polymorphs to 
endothelial cells is analysed. 

30 



Chemotaxis. This is performed using the standard Boyden chamber chemotaxis 
assay. 
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Plasminogen activator assay. Endothelial cells are tested for plasminogen activator 
and plasminogen activator inhibitor production upon addition of SOM175 (Pepper et 
al (1991)). 

5 Endothelial cell migration assay. The ability of SOM175 to stimulate endothelial 
cells to migrate and form tubes is assayed as described in Montesano et al (1986). 

Angiogenesis Assay 

SOM175 induction of an angiogenic response in chick chorioallantoic membrane is 
0 evaluated as described in Leung et al (1989). 

Possible neurotrophic actions of SOM175 are assessed using the following assays: 

Neurite outgrowth assay and gene induction (PC12 cells) 

5 PC 12 cells (a phaeochromocytoma cell line) respond to NGF and other neurotrophic 
factors by developing the characteristics of sympathetic neurons, including the 
induction of early and late genes and the extension of neurites. These cells are 
exposed to SOM175 and their response monitored (Drinkwater et al (1991); and 
Drinkwater et al (1993)). 

0 

Cultured neurons from the Peripheral Nervous System (PNS) 

Primary cultures of the following PNS neurons are exposed to SOM175 and 
monitored for any response: 

- sensory neurons from neural crest and dorsal root ganglia 
5 - sympathetic neurons from sympathetic chain ganglia 

- placode derived sensory neurons from nodose ganglia 

- motoneurons from spinal cord 

The assays are described in Suter et al (1992) and in Marinou et al (1992). 

0 Where an in vitro response is observed, in vivo assays for properties such as uptake 
and retrograde transport are performed as described in Hendry et al (1992). 
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Nerve regeneration (PNS) 

Where neurotrophic effects of SOM175 are observed, its possible role in the 
regeneration of axotomised sensory neurons, sympathetic neurons and motoneurons is 
analysed by the methods of Otto et al (1989); Yip et al (1984) and Hendry et al 
5 (1976). 

Actions of SOM175 on CNS neurons 

The ability of SOM175 to promote survival of central nervous system neurons is 
analysed as described in Hagg et al (1992); Williams et al (1986); Hefti (1986) and 
10 Kromer (1987). 

Wound Healing 

The ability of SOM175 to support wound healing are tested in the most clinically 
relevant model available, as described in Schilling et al (1959) and utilised by Hunt 
15 etal (1967). 

The Haemopoietic System 

A variety of in vitro and in vivo assays on specific cell populations of the 
haemopoietic system are available and are outlined below: 
20 Stem Cells 
Murine 

A variety of novel in vitro murine stem cell assays have been developed using 
FACS-purified cells: 

25 (a) Repopulating Stem Cells 

These are cells capable of repopulating the bone marrow of lethally irradiated mice, 
and have the Lin", Rh hl , Ly-6A/E + , c-kit + phenotype. The test substance is tested on 
these cells either alone, or by co-incubation with multiple factors, followed by 
measurement of cellular proliferation by 3 H thymidine incorporation. 
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(b) Late Stage Stem Cells 

These are cells that have comparatively little bone marrow repopulating ability but 
can generate D13 CFU-S. These cells have the Lin", Rh hi , Ly-6A/E + , c-kit + 
5 phenotype. The test substance is incubated with these cells for a period of time, 
injected into lethally irradiated recipients, and the number of D13 spleen colonies 
enumerated. 

(c) Progenitor-Enriched Cells 

10 These are cells that respond in vitro to single growth factors, and have the Lin", Rh hl , 
Ly-6A/E + , c-ldf phenotype. This assay will show if SOM175 can act directly on 
haemopoietic progenitor cells. The test substance is incubated with these cells in 
agar cultures, and the number of colonies enumerated after 7-14 days. 

1 5 Atherosclerosis 

Smooth muscle cells play a crucial role in the development or initiation of 
atherosclerosis, requiring a change in their phenotype from a contractile to a synthetic 
state. Macrophages, endothelial cells, T lymphocytes and platelets all play a role in 
the development of atherosclerotic plaques by influencing the growth and phenotypic 
20 modulations of smooth muscle cell. An in vitro assay that measures the proliferative 
rate and phenotypic modulations of smooth muscle cells in a multicellular 
environment is used to assess the effect of SOM175 on smooth muscle cells. The 
system uses a modified Rose chamber in which different cell types are seeded onto 
opposite coverslips. 

25 

Effects of SOM175 on bone 

The ability of SOM175 to regulate proliferation of osteoblasts is assayed as described 
in Lowe et al (1991). Any effects on bone resorption are assayed as described in 
Lowe et al (1991). Effects on osteoblast migration and changes in intracellular 
30 molecules (e.g. cAMP accumulation, alkaline phosphatase levels) are analysed as 
described in Midy et al (1994). 
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Effects on skeletal muscle cells 

Effects of SOM175 on proliferation of myoblasts and development of myotubes can 
be determined as described by Ewton et a! (1980) and by Gospodarowicz et al 
(1976). 

EXAMPLE 5 
CLONING MURINE VEGF DNA 

Isolation of cDNAs 

Murine VRF (mVRF) clones were selected from a lambda Zap new born whole brain 
cDNA library (Stratagene). Primary' phage from high density filters (5 x 10 4 
pfu/plate) were identified by hybridisation with a 682bp 32 P-labelled probe generated 
by PCR from an hVRF cDNA (pSOM175) as described above. Hybridisation and 
stringent washes of nylon membranes (Hybond-N) were carried out at 65°C under 
conditions described by Church and Gilbert (1984). Positive plaques were picked, 
purified and excised in vivo to produce bacterial colonies containing cDNA clones in 
pBluescript SK-. 

Isolation of genomic clones 

Genomic clones were isolated from a mouse strain SV/129 library cloned in the 
lambda Fix II vector (Stratagene). High density filters (5 x 10 4 pfu/filter) were 
screened with a 563 bp 32 P-labelled probe generated by PCR amplification of the 
nucleotide 233-798 region of the mVRF cDNA (see Figure 9). Positive clones were 
plugged and re-screened with filters containing 400-800 pfu. Large scale phage 
preparations were prepared using the QIAGEN lambda kit or by ZnCl 2 purification 
(Santos, 1991). 

Nucleotide sequencing and analysis 

cDNAs were sequenced on both strands using a variety of vector-based and internal 
primers with Applied Biosystems Incorporated (ABI) dye terminator sequencing kits 
according to the manufacturer's specifications. Sequences were analysed on an ABI 
Model 3 73 A automated DNA sequencer. Peptide homology alignments were 
performed using the program BESTFIT (GCG, Wisconsin). 
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Identification of intron/exon boundaries 

Identification of exon boundaries and flanking regions was carried out using PCR 
with mouse genomic DNA or mVRF genomic lambda clones as templates. The 
primers used in PCR to identify introns were derived from the hVRF sequence and to 
5 allow for potential human-mouse sequence mismatches annealing temperatures 5- 
10°C below the estimated T m were used. All PCR products were sized by agarose 
gel electrophoresis and gel purified using QIAquick spin columns (Qiagen) and the 
intron/exon boundaries were sequenced directly from these products. In addition, 
some splice junctions were sequenced from subcloned genomic fragments of MVRF. 
10 Intron/exon boundaries were identified by comparing cDNA and genomic DNA 
sequences. 

Northern analysis 

Total cellular RNA was prepared from a panel of fresh normal adult mouse tisues 
15 (brain, kidney, liver, muscle) using the method of Chomczynski and Sacchi (1987). 
20ug of total RNA were electrophoresed, transferred to a nylon membrane (Hybond 
N, Amersham) and hybridised under standard conditions (Church & Gilbert, 1984). 
Filters were washed at 65°C in O.lxSSC (20xSSC is 3M NaCl/0.3M trisodium 
citrate), 0.1% SDS and exposed to X-ray film with intensifying screens at -70°C for 
20 1-3 days. 

Characterisation of mVRF cDNAs 

Murine VRF homologues were isolated by screening a murine cDNA library with an 
hVRF cDNA clone. Five clones of sizes varying from 0.8-1.5 kb were recovered 
25 and sequenced. The cDNA sequences were complied to give a full length 1041 bp 
cDNA sequence covering the entire open reading frame (621 bp or 564 bp depending 
on the splice form, see below) and 3' UTR (379 bp), as well as 163 bp of the 5' 
UTR (Figure 9). 

30 The predicted initiation codon matched the position of the start codon in hVRF. One 
other out of frame ATG was located at position -47 and two termination codons were 
observed upstream (positions -9 and -33, respectively) and in-frame with the putative 
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initiation codon. 

The predicted N-terminal signal peptide of hVRF appears to be present in mVRF 
with 81% identity (17/21 amino acids). Peptide cleavage within mVRF is expected 
to occur after reside 21 (Figure 10), These data suggest that mature mVRF is 
secreted and could therefore conceivably function as a growth factor. 

As with hVRF, two open reading frames (ORFs) were detected in cDNAs isolated by 
library screening. Four of five clones were found to be alternatively spliced and 
lacked a 101 bp fragment homologous to exon 6 of hVRF. The predicted peptide 
sequences of the two isoforms of mVRF were determined and aligned with the 
corresponding human isoforms (Figure 10). 

The message encoding mVRF !86 contains a 621 bp ORF with coding sequences 
terminating at position +622, towards the end of exon 7 (Figure 9). The smaller 
message encoding mVRF 167 actually terminates downstream of the +622 TAG site 
due to a frame shift resulting from splicing out of the 101 bp exon 6 and the 
introduction of a stop codon (TGA) at position +666, near the beginning of exon 8 
(Figure 9). 

The mVRF lg6 protein has strong homology to the amino and central portions of 
VEGF while the carboxyl end is completely divergent an is alanine rich. mVRF 167 
possesses these similarities and also maintains homology to mVEGF right through to 
the C-terminus (Figure 1 1). The overall homology of mVRF 167 to hVRF 167 was 
85% identity and 92% similarity, respectively (Figure 10). Likewise, homology 
between mVRF 167 and mVEGF (Breier et a!, 1992) was 49% identity and 71% 
conservative amino acid substitution, respectively (Figure 11). 

A canonical vertebrate polyadenylation signal (AATAAA) (Birnstiel et al, 1986) was 
not present in the mVRF cDNA, however, the closely matching sequence GAT AAA 
is present at similar positions in both mouse and human VRF cDNAs (Figure 9). In 
contrast to hVRF, mVRF was found to contain an AC dinucleotide repeat at the 
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extreme 3' end of the 3' UTR (nucleotide positions 998 to 1011, Figure 9). 
Polymorphism of this repeat region was observed between some of the mVRF 
cDNAs, with the number of dinucleotides varying from 7 to 11. 

5 Genomic characterisation of mVRF 

Intron/exon boundaries (Table 3) were mapped using primers which flanked 
sequences homologous to the corresponding hVRF boundaries. Introns I, III, IV and 
VI of mVRF (Table 3, Figure 12) were smaller than the hVRF intervening 
sequences. The complete genomic sequence was compiled from the 5' UTR of 

10 mVRF through to intron VI, the largest intervening region (2.2 kb), by sequencing 
amplified introns and cloned genomic portions of mVRF. There was only one major 
difference in genomic structure between mVRF and hVRF and that was the exon 
7/intron VI boundary of mVRF was located 1 Obp further downstream in relation to 
the cDNA sequence, hence exon 7 in mVRF is lObp longer than the corresponding 

15 exon in hVRF. 

Exons 6 and 7 are contiguous in mVRF, as has been found to occur in the human 
homologue. The strong sequence homology between exon 6 of mVRF and hVRF 
(Figure 1 0) suggests that this sequence is not a retained intronic sequence but rather 
20 encodes a functional part of the VRF, 86 isoform. 

General intron/exon structure is conserved between the various members of the 
VEGF gene family (VEGF, PIGF, hVRF) and therefore it is not surprising that the 
overall genomic organisation of the mVRF gene is very similar to these genes 
25 (Figure 12). 

Previous comparative mapping studies have shown that the region surrounding the 
human multiple endocrine neoplasia type 1 disease locus on chromosome llql3 is 
syntenic with the proximal segment of mouse chromosome 19 (Rochelle et al, 1992). 
30 Since the inventors have mapped the hVRF gene to within Ikb of the human MEN1 
locus (see above) it is most likely that the murine VRF gene maps near the 
centromere of chromosome 19. 
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Expression studies of mVRF 

Northern analysis of RNA from adult mouse tissues (muscle, heart, lung and liver) 
showed that expression appears to be ubiquitous and occurs primarily as a major 
band of approximately 1.3kb in size (Figure 14). This is somewhat different to the 
5 pattern observed for hVRF in which two major bands of 2.0 and 5.5 kb have been 
identified in all tissues examined. The 1.3 kb murine message presumably 
corresponds to the shorter of the human transcripts and the size variation thereof is 
most likely due to a difference in the length of the respective 5' UTRs. 

10 EXAMPLE 6 

EXPRESSION OF MURINE VEGF IN PRE- AND POST-NATAL MOUSE 
Animals 

Timed pregnant (n=4) and young adult (n=2) mice (C57 inbred strain, ALAB, 
Sweden) were sacrificed with carbon dioxide, and the relevant tissues were taken out 
15 and frozen on a chuck. Tissues were kept at -70°C until further use. Two 
gestational ages was used in this study; embryonic day 8 (E8), 14 and £17. 

In situ hybridisation histochemistry 

In situ hybridisation was performed as previously described (Dagerlind et al, 1992). 
20 Briefly, transverse sections (14um) were cut in a cryostat (Microm, Germany), 
thawed onto Probe-On slides (Fisher Scientific, USA) and stored in black sealed 
boxes at -70°C until used. The sequences of the synthetic 42-mer oligonucleotides 
complementary to mRNA encoding mVRF were 

ACCACCACCTCCCTGGGCTGGCATGTGGCACGTGCATAAACG [SEQ ID 

25 NO:ll] (complementary to nt 120-161) and 

AGTTGTTTGACCACATTOCCCATGAGTTCCATGCTCAGAGGC [SEQ ID 
NO: 12] (complementary to nt 162-203). To detect the two alternative splice forms 
oligonucleotide GATCCTGGGGCTGGAGTGGGATGGATGATGTCAGCTGG [SEQ 
ID NO: 13] (complementary to nt xxx-xxx) and 

30 GCGGGCAGAGGATCCTGGGGCTGTCTGGCCTCACAGCACT [SEQ ID NO: 14] 
were used. The probes were labeled at the 3 '-end with deoxyadenosine- 
alpha[thio]triphosphate [ 35 S] (NEN, USA) using terminal deoxynucleotidyl 
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transferase (TBI, USA) to a specific activity of 7-10 x 10 8 cpm/ug and hybridised to 
the sections without pretreatment for 16-18 h at 42°C. The hybridisation mixture 
contained: 50% v/v formamide, 4 x SSC (1 x SSC = 0.15M NaCl and 0.015M 
sodium-citrate), 1 x Denhardt's solution (0.02% each of polyvinyl-pyrrolidone, BSA 
5 and FicollV 1% v/v sarcosyl (N-lauroylsarcosine; Sigma), 0.02M phosphate buffer 
(pH 7.0), 1 0% w/v dextran sulfate (Pharmacia, Sweden), 250ug/ml yeast tRNA 
(Sigma), 500ug/ml sheared and heat denatured salmon sperm DNA (Sigma) and 
200mM dithiothreitol (DTT; LKB, Sweden). In control sections, the specificity of 
both probes was checked by adding a 20-fold excess of unlabeled probe to the 
10 hybridisation mixture. In addition, adjacent sections were hybridised with a probe 
unrelated to this study which gave a different expression pattern. Following 
hybridisation the sections were washed several times in 1 x SSC at 55°C, dehydrated 
in ethanol and dipped in NTB2 nuclear track emulsion (Kodak, USA). After 3-5 
weeks the sections were development in D-19 developer (Kodak, USA) and cover- 
15 slipped. In some cases, sections were opposed to an autoradiographic Film (Beta-max 
autoradiography film Amersham Ltd, UK) prior to emulsion-dipping. 

The four different probes gave identical hybridisation patterns in all tissues examined. 
Mouse VRF expression was detecting already in the E8 embryo, in which positive 

20 signal was recorded over structures most likely corresponding to the neuronal tube. 
In sagittal sections of El 4 mouse embryo the strongest hybridisation signal was 
present over heart and in the nervous system, especially cerebral cortex (Figure 14A). 
A low level of expression was present in all other tissues. At a later gestational age, 
El 7, a high mVRF mRNA signal was confined to he heart and brown fat tissue in 

25 the back and around the neck (Figure 14B). Clearly positive hybridisation signals 
were present in the gray of the spinal cord and in the tongue (Figure 14B). 
Expression in the cerebral cortex was clearly reduced compared to day 14. The weak 
background expression seen in the E14 embryo in for example muscle, had decreased 
at this gestational age. A strong mVRF mRNA hybridisation signal was present 

30 solely over the heart and in the brown fat in the young adult mice (Figure 14C). The 
signal over the heart was evenly distributed ove the entire ventricular wall, including 
the papillary muscles (Figure 14D). In sections of heart tissue hybridised with an 
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excess of cold probe, no specific labeling over background signal was recorded 
(Figure 14E). 

Apart from the heart, mVRF mRNA signal was present over certain tissues on the 
5 outside of the thoracic cage that morphologically resembled brown fat. This was 
verified with sudan black counterstaining, which showed a strong staining in the 
same areas (Figure 15A and 15B). In transverse sections of adult mouse spinal cord, 
the mVRF probes gave a neuronal staining pattern over the gray matter (Figure 15C). 
Counterstaining with toluidine (Figure 15D) showed that motoneurons in the ventral 
10 horn (Figure 15C and 15D), interneurons (Figure 15C) in the deep part of the dorsal 
horn and around the central canal where to a large extent positive for mVRF mRNA. 

EXAMPLE 7 

EFFECTS OF VEGF AND SOM175 PROTEINS ON CHICK 
15 SENSORY NEURONS 

The effects of VEGF and SOM175 proteins on embryonic day 8 chick sensory 
neurons were determined using the method of Nurcombe et al (1992). The neuronal 
assay was read at 48 hours using 2000 cells per assay well. The results were 
obtained using 3 H-thymidine counts. The percentage survival of neurons, neurite 
20 outgrowth and average neurite length in am were determined using NGF as positive 
control and various concentrations of VEGF, VEGF in the presence of heparin and 
VEGF in the presence of heparin and 5 uM, 5'-flurouracil (5FU). 5FU kills glial 
cells. 

25 The results are shown in Figure 16. The results show that VEGF is effective in 

promoting neuronal survival but that this requires the presence of glial cells. Figure 
17 shows the results of the effect of VEGF and SOM175 on three types of chick 
glia. The glia tested were CNS gliai, peripheral glia and CNS oligodendrocytes. 
Heparin was used as 10 ug/ml in all cultures and the assay was read at 24 hours. 

30 Results were measured in 3 H-thymidine counts using 2000 cells per well. 
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The results show that for chick.central and peripheral neurons, astroglia were 
markedly stimulated to proliferate by SOM1 75 in the presence of heparin but that 
chick oligodendrocytes showed negligible increase in the rate of division. 

5 

EXAMPLE 8 

EFFECTS OF SOM175 PROTEINS ON MOUSE PRIMARY AND 
CENTRAL NEURONS 

The results in Example 7 show that VEGF isoform had an effect on chick primary 
10 and central neurons through the agency of the astroglial cells. Similar experiments 
were repeated in mouse cells. 

Culture conditions 

Neuronal and gligal cells for all in vitro experiments were prepared and cultured 
5 according o the techniques described in "Methods in Neurosciences (Vol. 2): Cell 
Culture" Ed. P.M. Conn, Academic Press, San Diego, 1990, pp33-46 for astroglial 
cells, pp56-74 for oligodendroglial cells, and pp87-102 for central neurons. 

Cells were plated onto 24-well culture clusters (Nunc) coated with poly-L-ornithine 
(0.1 mg/ml, Ih) at a density of 2,000 cells/well. After 48 hours in culture, neurons 
were counted in the wells under inverted phase light using well established 
techniques (Maruta et al. 1993) and glial cells assessed with [ 3 H]thymidine uptake to 
monitor cell division rates as below. Heparin (lOug/ml, low molecular weight 
fraction, Sigma Chemical Corp.) was present at all times in the culture media except 
where noted. The neuronal cultures were supplemented with 5mM 5-fluoro-2- 
deoxyuridine (Sigma) to suppress background glial growth. 

3 H-Thymidine incorporation assay for glial cell proliferation 

The cells were pulsed for 14h with 3 H-thymidine (specific activity 103 p.Ci/ug) fraom 
a stock concentration of 0.1 mCi/ml in standard medium, giving a final incubating 
volume of 20 ul/well. The contents of the wells were harvested and absorbed onto 
nitrocellulose paper (Titertek, Flow). Remaining adherent cells were removed by 



WO 96/27007 



PCT/ATJ96/00094 



- 31 - 

incubation with trypsin/versene (CSL Limited, Victoria, Australia) for 5 min. This 
procedure was carried out twice. The nitrocellulose discs were washed in a standard 
Titertek harvester (Flow) using first distilled water, and then methanol. The 
nitrocellulose discs were dried, scintillation fluid (containing 5% v/v Triton-X) added 
5 and the discs counted on a scintillation counter. 

Greatest activity was seen with preparations of SOM175 absent exon 6 (SOMaX6) 
on mouse astroglial cell cultures, where there was a significant stimulus to their 
proliferation when delivered in conjunction with heparin (Figure 16). Little stimulus 
10 was given to the proliferation of oligodendroglial cells (Figure 17), and very little 
discernable potentiation of the survival response of isolated forebrain neurons (Figure 
18). The standard deviation on all three graphs for each point was less than 8%. 

The viability of neurons can be maintained by promoting glial cell proliferation. 
15 Furthermore, SOMaX6 is a good inducer of astroglial proliferation and may be 

expressed in conjunction with the formation of astroglial endfeet on central nervous 
system endothelial cells. 

Those skilled in the art will appreciate that the invention described herein is 
20 susceptible to variations and modifications other than those specifically described. It 
is to be understood that the invention includes all such variations and modifications. 
The invention also includes all of the steps, features, compositions and compounds 
referred to or indicated in this specification, individually or collectively, and any and 
all combinations of any two or more of said steps or features. 



25 
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TABLE 3 

Splice junctions of the murine VRF gene 

5 5' UTR* Exon 1 >223bp CCCAGgtacgtgcgt Intron I 495bp 

ttccccacagGCCCC Exon 2 43bp GAAAGgtaataatag Intron II 288bp 

ctgcccacagTGGTG Exon 3 197bp TGCAGgtaccagggc Intron III 196bp 

ctgagcacagATCCT Exon 4 74bp TGCAGgtgccagccc Intron IV 182bp 

ctcttttcagACCTA Exon 5 36bp GACAGattcttggtg Intron V 191 bp 

10 ctcctcctagGGTTG Exon 6 lOlbp (no intron) 

CCCACTCCAGCCCCA Exon 7 135bp TGTAGgtaaggagtc Intron VI ~2200bp 

cactccccagGTGCC Exon 8 394bp AGAGATGGAGACACT 



15 



Uppercase and lowercase letters denote exonic and intronic sequences respectively. 
* Indicates that the 5' end of exon 1 has not yet been determined. 
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(i) APPLICANT: 

(countries other than US) AMRAD OPERATIONS PTY. LTD. 
(us only) Hayward, N and Weber, G 

(ii) TITLE OF INVENTION: A NOVEL GROWTH FACTOR AND 
GENETIC SEQUENCE ENCODING SAME 

(iii) NUMBER OF SEQUENCES: 14 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: DA VIES COLLISON CAVE 

(B) STREET: 1 LITTLE COLLINS STREET 

(C) CITY: MELBOURNE 

(D) STATE: VICTORIA 

(E) COUNTRY: AUSTRALIA 

(F) ZIP: 3000 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 
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(B) FILING DATE: 22-FEB-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: AU PN1457 

(B) FILING DATE: 02-MAR-1995 

(A) APPLICATION NUMBER: AU PN6647 

(B) FILING DATE: 20-NOV-1995 

(A) APPLICATION NUMBER: AU PN7274 

(B) FILING DATE: 22-DEC-1995 
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(viii) ATTORNEY/AGENT INFORMATION: 
(A) NAME: HUGHES DR, E JOHN L 
(C) REFERENCE/DOCKET NUMBER: EJH/EK 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: +61 3 9254 2777 

(B) TELEFAX: +61 3 9254 2770 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 17... 589 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



TCGGGCCTCC GAAACC ATG AAC TTT CTG CTG TCT TGG GTG CAT TGG AGC 
Met Asn Phe Leu Leu Ser Trp Val His Trp Ser 



CTT GCC TTG CTG CTC TAC CTC CAC CAT GCC AAG TGG TCC CAG GCT GCA 
Leu Ala Leu Leu Leu Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala 



CCC ATG GCA GAA GGA GGA GGG CAG AAT CAT CAC GAA GTG GTG AAG TTC 
Pro Met Ala Glu Gly Gly Gly Gin Asn His His Glu Val Val Lys Phe 



ATG GAT GTC TAT CAG CGC AGC TAC TGC CAT CCA ATC GAG ACC CTG GTG 
Met Asp Val Tyr Gin Arg Ser Tyr Cys His Pro He Glu Thr Leu Val 



GAC ATC TTC CAG GAG TAC CCT GAT GAG ATC GAG TAC ATC TTC AAG CCA 
Asp He Phe Gin Glu Tyr Pro Asp Glu He Glu Tyr He Phe Lys Pro 



TCC 
Ser 



TGT GTG CCC CTG ATG CGA TGC GGG GGC TGC TGC AAT GAC GAG GGC 
Cys Val Pro Leu Met Arg Cys Gly Gly Cys Cys Asn Asp Glu Gly 
80 85 90 



289 
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CTG GAG TGT GTG CCC ACT GAG GAG TCC AAC ATC ACC ATG CAG ATT ATG 
Leu Glu Cys Val Pro Thr Glu Glu Ser Asn lie Thr Met Gin lie Met 
95 100 105 

CGG ATC AAA CCT CAC CAA GGC CAG CAC ATA GGA GAG ATG AGC TTC CTA 
Arg lie Lys Pro His Gin Gly Gin His lie Gly Glu Met Ser Phe Leu 
110 115 120 

CAG CAC AAC AAA TGT GAA TGC AGA CCA AAG AAA GAT AGA GCA AGA CAA 
Gin His Asn Lys Cys Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin 
125 130 135 

GAA AAT CCC TGT GGG CCT TGC TCA GAG CGG AGA AAG CAT TTG TTT GTA 
Glu Asn Pro Cys Gly Pro Cys Ser Glu Arg Arg Lys His Leu Phe Val 
140 145 150 155 

CAA GAT CCG CAG ACG TGT AAA TGT TCC TGC AAA AAC ACA GAC TCG CGT 
Gin Asp Pro Gin Thr Cys Lys Cys Ser Cys Lys Asn Thr Asp Ser Arg 
160 165 170 

TGC AAG GCG AGG CAG CTT GAG TTA AAC GAA CGT ACT TGC AGA TGT GAC 
Cys Lys Ala Arg Gin Leu Glu Leu Asn Glu Arg Thr Cys Arg Cys Asp 
175 180 185 

AAG CCG AGG CGG TGAGCCGGGC AGGAGGAAGG AGCCTCCCTC AGCGTTTCGG 
Lys Pro Arg Arg 
190 

GAACCAGATC TCTCACCAGG 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Asn Phe Leu Leu Ser Trp Val His Trp Ser Leu Ala Leu Leu Leu 
15 10 15 

Tyr Leu His His Ala Lys Trp Ser Gin Ala Ala Pro Met Ala Glu Gly 
20 25 30 

Gly Gly Gin Asn His His Glu Val Val Lys Phe Met Asp Val Tyr Gin 



Arg Ser Tyr Cys His Pro lie Glu Thr Leu Val Asp lie Phe Gin Glu 
50 55 60 
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Tyr Pro Asp Glu lie Glu Tyr lie Phe Lys Pro Ser Cys Val Pro Leu 
65 70 75 80 

Met Arg Cys Gly Gly Cys Cys Asn Asp Glu Gly Leu Glu Cys Val Pro 
85 90 95 

Thr Glu Glu Ser Asn lie Thr Met Gin lie Met Arg lie Lys Pro His 
100 105 110 

Gin Gly Gin His lie Gly Glu Met Ser Phe Leu Gin His Asn Lys Cys 
115 120 125 

Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gin Glu Asn Pro Cys Gly 
130 135 140 

Pro Cys Ser Glu Arg Arg Lys His Leu Phe Val Gin Asp Pro Gin Thr 
145 150 155 160 

Cys Lys Cys Ser Cys Lys Asn Thr Asp Ser Arg Cys Lys Ala Arg Gin 
165 170 175 

Leu Glu Leu Asn Glu Arg Thr Cys Arg Cys Asp Lys Pro Arg Arg 
180 185 190 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 624 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



CC ATG AGC CCT CTG CTC CGC CGC CTG CTG CTC GCC GCA CTC CTG CAG 4 7 

Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin 
15 10 15 

CTG GCC CCC GCC CAG GCC CCT GTC TCC CAG CCT GAT GCC CCT GGC CAC 95 
Leu Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His 
20 25 30 

CAG AGG AAA GTG GTG TCA TGG ATA GAT GTG TAT ACT CGC GCT ACC TGC 143 
Gin Arg Lys Val Val Ser Trp He Asp Val Tyr Thr Arg Ala Thr Cys 
35 40 45 
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CAG CCC CGG GAG GTG GTG GTG CCC TTG ACT GTG GAG CTC ATG GGC ACC 191 
Gin Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr 
50 55 60 

GTG GCC AAA CAG CTG GTG CCC AGC TGC GTG ACT GTG CAG CGC TGT GGT 2 39 

Val Ala Lys Gin Leu Val Pro Ser Cys Val Thr Val Gin Arg Cys Gly 
65 70 75 

GGC TGC TGC CCT GAC GAT GGC CTG GAG TGT GTG CCC ACT GGG CAG CAC 287 
Gly Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gin His 
80 85 90 95 

CAA GTC CGG ATG CAG ATC CTC ATG ATC CGG TAC CCG AGC AGT CAG CTG 33 5 

Gin Val Arg Met Gin He Leu Met He Arg Tyr Pro Ser Ser Gin Leu 
100 105 110 

GGG GAG ATG TCC CTG GAA GAA CAC AGC CAG TGT GAA TGC AGA CCT AAA 38 3 

Gly Glu Met Ser Leu Glu Glu His Ser Gin Cys Glu Cys Arg Pro Lys 
115 120 125 

AAA AAG GAC AGT GCT GTG AAG CCA GAC AGG GCT GCC ACT CCC CAC CAC 431 
Lys Lys Asp Ser Ala Val Lys Pro Asp Arg Ala Ala Thr Pro His His 
130 135 140 

CGT CCC CAG CCC CGT TCT GTT CCG GGC TGG GAC TCT GCC CCC GGA GCA 479 
Arg Pro Gin Pro Arg Ser Val Pro Gly Trp Asp Ser Ala Pro Gly Ala 
145 150 155 

CCC TCC CCA GCT GAC ATC ACC CAT CCC ACT CCA GCC CCA GGC CCC TCT 52 7 

Pro Ser Pro Ala Asp He Thr His Pro Thr Pro Ala Pro Gly Pro Ser 
160 165 170 175 

GCC CAC GCT GCA CCC AGC ACC ACC AGC GCC CTG ACC CCC GGA CCT GCC 57 5 

Ala His Ala Ala Pro Ser Thr Thr Ser Ala Leu Thr Pro Gly Pro Ala 
180 185 190 

GCT GCC GCT GCC GAC GCC GCA GCT TCC TCC GTT GCC AAG GGC GGG GCT T 62 4 

Ala Ala Ala Ala Asp Ala Ala Ala Ser Ser Val Ala Lys Gly Gly Ala 
195 200 205 

AGAGCTCAAC CCAGACACCT GCAGGTGCCG GAAGCTGCGA AGGTGACACA TGGCTTTTCA 684 

GACTCAGCAG GGTGACTTGC CTCAGAGGCT ATATCCCAGT GGGGGAACAA AGGGGAGCCT 744 

GGTAAAAAAC AGCCAAGCCC CCAAGACCTC AGCCCAGGCA GAAGCTGCTC TAG GAC CTGG 8 04 

GCCTCTCAGA GGGCTCTTCT GCCATCCCTT GTCTCCCTGA GGCCATCATC AAACAGGACA 8 64 

GAG TTGG AAG AGGAGACTGG GAGGCAGCAA GAGGGGTCAC ATACCAGCTC AGGGGAGAAT 924 

GGAGTACTGT CTCAGTTTCT AACCACTCTG TGCAAGTAAG CATCTTACAA CTGGCTCTTC 984 

CTCCCCTCAC TAAGAAGACC CAAACCTCTG CATAATGGGA TTTGGGCTTT GGTACAAGAA 1044 



CTGTGACCCC CAACCCTGAT AAAAGAGATG GAAGGAAAAA AAAAAAAAAA 



1094 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 07 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 



Met Ser Pro Leu Leu 
1 5 

Ala Pro Ala Gin Ala 
20 

Arg Lys Val Val Ser 
35 

Pro Arg Glu Val Val 
50 

Ala Lys Gin Leu Val 
65 

Cys Cys Pro Asp Asp 
85 

Val Arg Met Gin He 
100 

Glu Met Ser Leu Glu 
115 

Lys Asp Ser Ala Val 
130 

Pro Gin Pro Arg Ser 
145 

Ser Pro Ala Asp He 
165 

His Ala Ala Pro Ser 
180 

Ala Ala Ala Asp Ala 
195 



Arg Arg Leu Leu Leu Ala 

10 

Pro Val Ser Gin Pro Asp 
25 

Trp He Asp Val Tyr Thr 
4 0 

Val Pro Leu Thr Val Glu 
55 

Pro Ser Cys Val Thr Val 
70 75 

Gly Leu Glu Cys Val Pro 
90 

Leu Met He Arg Tyr Pro 
105 

Glu His Ser Gin Cys Glu 
120 

Lys Pro Asp Arg Ala Ala 
135 

Val Pro Gly Trp Asp Ser 
150 155 

Thr His Pro Thr Pro Ala 
170 

Thr Thr Ser Ala Leu Thr 
185 

Ala Ala Ser Ser Val Ala 
200 



Ala Leu Leu Gin Leu 

15 

Ala Pro Gly His Gin 
30 

Arg Ala Thr Cys Gin 
45 

Leu Met Gly Thr Val 
60 

Gin Arg Cys Gly Gly 
80 

Thr Gly Gin His Gin 
95 

Ser Ser Gin Leu Gly 
110 

Cys Arg Pro Lys Lys 
125 

Thr Pro His His Arg 
140 

Ala Pro Gly Ala Pro 
160 

Pro Gly Pro Ser Ala 
175 

Pro Gly Pro Ala Ala 
190 

Lys Gly Gly Ala 
205 
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(2) INFORMATION rOk SEQ ID NO:S: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 993 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3 . . 566 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



CC ATG AGC CCT CTG CTC CGC CGC CTG CTG CTC GCC GCA CTC CTG CAG 

Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin 



CTG GCC CCC GCC CAG GCC CCT GTC TCC CAG CCT GAT GCC CCT GGC CAC 
Leu Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His 



CAG AGG AAA GTG GTG TCA TGG ATA GAT GTG TAT ACT CGC GCT ACC TGC 
Gin Arg Lys Val Val Ser Trp He Asp Val Tyr Thr Arg Ala Thr Cys 



CAG CCC CGG GAG GTG GTG GTG CCC TTG ACT GTG GAG CTC ATG GGC ACC 
Gin Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr 



GTG GCC AAA CAG CTG GTG CCC AGC TGC GTG ACT GTG CAG CGC TGT GGT 
Val Ala Lys Gin Leu Val Pro Ser Cys Val Thr Val Gin Arg Cys Gly 



GGC TGC TGC CCT GAC GAT GGC CTG GAG TGT GTG CCC ACT GGG CAG CAC 
Gly Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gin His 



CAA GTC CGG ATG CAG ATC CTC ATG ATC CGG TAC CCG AGC AGT CAG CTG 
Gin Val Arg Met Gin He Leu Met He Arg Tyr Pro Ser Ser Gin Leu 
100 105 110 

GGG GAG ATG TCC CTG GAA GAA CAC AGC CAG TGT GAA TGC AGA CCT AAA 
Gly Glu Met Ser Leu Glu Glu His Ser Gin Cys Glu Cys Arg Pro Lys 
115 120 125 

AAA AAG GAC AGT GCT GTG AAG CCA GAT AGC CCC AGG CCC CTC TGC CCA 
Lys Lys Asp Ser Ala Val Lys Pro Asp Ser Pro Arg Pro Leu Cys Pro 
130 135 140 
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CGC TGC ACC CAG CAC CAC CAG CGC CCT CAC CCC CGG ACC TGC CGC TGC 4 79 

Arc? Cys Thr Gin Hia Hia Gin Arg Pro Asp Pro Arg Thr Cys Arg Cyn 
145 150 155 

CGC TGC CGA CGC CGC AGC TTC CTC CGT TCC CAA GCG CCC GGC TTA GAC 521 
Arg Cys Arg Arg Arg Ser Phe Leu Arg eye cln Cly Arg Gly L<:u Glu 
160 16S 170 175 

CTC AAC CCA GAC ACC TGC AGG TGC CGG AAG CTG CGA AGG TGACACATGG 5,7 6 
Leu Aan Pro Aop Thr Cya Arg Cya Arg Lya Leu Arq Arg 
180 IBS 

CTTTTCAGAC TCAGCAGGGT GACTTGCCTC AGACGCTATA TCCCAGTOGG GGAACAAAGG 63 6 

GGAGCCTGGT AAAAAACAGC CAAGCCCCCA AGACCTCAGC CCAGCCAGAA GCTGCTCTAG fi^fi 

GACCTGGGCC 7CTCAGAGGG CTCTTCTGCC ATCCCTTGTC TCCCTGACWC CATCATCAAA 7S6 

CAGCACAGAG TTGGAAGAGG AGACTGGGAG GCAGCAAGAG GGGTCACATA CCACCTCAGG 816 

CGAGAATGGA GTACTGTCTC AGTTTCTAAC CACTCTGTGC AAGTAAGCAT CTTACAACTG 87S 

QCTCTTCCTC CCCTCACTAA GAAGACCCAA ACCTCTOCAT AATOGGATTT CCGCTTTCCT P3S 

ACAAGAACTG TGACCCCCAA CCCTGATAAA AGAGATGGAA GGAAAAAAAA AAAAAAA <>.n 



(2) INFORMATION FOR SE0 ID )JO:6: 



U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1R9 Amino Actidn 
(D) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii! MOLECITLE TYPE: protein 

(xi) SEQUENCE DESCRIPTION ; SEQ ID HO : G : 



Mec Ser Pro Leu Leu Arg Arg Leu L*u L»u Ala A1a I>U t.cu Gin Lr.u 

1 5 ID IS 

Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Ajp Ala Pro Gly Hio Gin 

20 25 30 

Axg Lya Val Val S-r Trp lie Asp Val Tyr Thr Arg Ala Thr Cya Gin 

35 40 45 

Pro Arq Glu Val Val Val Pro Leu Thr Val Glu Leu Mat Gly Thr Val 

50 55 60 

Ala Lya Gin Lou Val Pro Ser Cyo Val Thr v»l Gin Arg Cyn Oly Oly 
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Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gin His Gin 
85 90 95 

Val Arg Met Gin lie Leu Met He Arg Tyr Pro Ser Ser Gin Leu Gly 
100 105 110 

Glu Met Ser Leu Glu Glu His Ser Gin Cys Glu Cys Arg Pro Lys Lys 
115 120 125 

Lys Asp Ser Ala Val Lys Pro Asp Ser Pro Arg Pro Leu Cys Pro Arg 
130 135 140 

Cys Thr Gin His His Gin Arg Pro Asp Pro Arg Thr Cys Arg Cys Arg 
145 150 155 160 

Cys Arg Arg Arg Ser Phe Leu Arg Cys Gin Gly Arg Gly Leu Glu Leu 
165 170 175 

Asn Pro Asp Thr Cys Arg Cys Arg Lys Leu Arg Arg 
180 185 



(2) INFORMATION FOR SEQ ID NO: 7: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 858 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3 . .431 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



CC ATG AGC CCT CTG CTC CGC CGC CTG CTG CTC GCC GCA CTC CTG CAG 4 7 

Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin 
15 10 15 

CTG GCC CCC GCC CAG GCC CCT GTC TCC CAG CCT GAT GCC CCT GGC CAC 95 
Leu Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His 
20 25 30 

CAG AGG AAA GTG GTG TCA TGG ATA GAT GTG TAT ACT CGC GCT ACC TGC 143 
Gin Arg Lys Val Val Ser Trp He Asp Val Tyr Thr Arg Ala Thr Cys 
35 40 45 

CAG CCC CGG GAG GTG GTG GTG CCC TTG ACT GTG GAG CTC ATG GGC ACC 191 
Gin Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr 
50 55 60 
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GTG GCC AAA CAG CTG GTG CCC ACC TGC GTG ACT GTG CAG COC TGT CGT 
Val Ala Lys Gin Leu Val Pro Ser Cys Val Thr Vnl Gin Ar a Cyn Gly 
SS 70 7S 

GGC TGC TGC CCT GAC GAT GGC CTG GAG TCT GTG CCC ACT CGG CAG CkC 
Gly Cy« Cys Pro A.ip Aup Gly Leu Glu Cys Vj 1 Pro Thr Gly Cln Hio 
60 85 .10 95 

CAA GTC CGG ATG CAG ATC CTC ATG ATC CGG TAC CCG AGC AGT CAC CTG 
Gin VaI Arg Mrt Gin lie Leu Me.C lift Arcf Tyr rro S?r Ser Gin Leu 
100 105 no 

CGG GAG ATG TCC CTG GAA GAA CAC AGC CAG TGT GAA TGC AGA CCT AAA 

Gly Glu Met Ser Leu Glu Glu Jlis Ser Gin Cyn Glu Cyn Arg Pro Lys 
US 120 125 • 

AAA AAG CAC AGT CCT GTG AAG CCA GAT AGG TGC CGG AAG CTG CGA ACC 

Lys Lys Asp Ser Ala Val Lya Pro Asp Arg Cy.i Arg Lyo Leu Arg Arg 

130 135 140 

TGACACATGG CTTTTCAGAC TCAGCAGGGT GACTTGCCTC AGAGGCTATA TCCCAGTGGG 
GGAACAAAGG GGAGCCTGGT AAAAAACAGC CAACCCCCCA AOACCTCACC CCACCCACAA 
OCTGCTCTAG GACCTGGGCC TCTCAGAGGG CTCTTCTGCC ATCCCTTCTC TCCCTGZGGC 
CATCATCAAA CAGGACAGAG TTCGAAGAGG ACACTGGGAG QCAGCAAGAG GCCTCACATA 
CCAGCTCACG GGAGAATGGA GTACTGTCTC AGTTTCTAAC CACTCTGTGC AAGTAAGCAT 
CTTACAACTG GCTCTTCCTC CCCTCACTAA GAAGACCCAA ACCTCTGCAT AATGGGATTT 
CGCCTTTGCT ACAAGAACTG TCACCCCCAA CCCTGATAAA AGAGATGGAA GGAAAAAAAA 
AAAAAAA 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin Leu 
15 10 15 

Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His Gin 
20 25 30 

Arg Lys Val Val Ser Trp lie Asp Val Tyr Thr Arg Ala Thr Cys Gin 
35 40 45 

Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr Val 
50 55 60 

Ala Lys Gin Leu Val Pro Ser Cys Val Thr Val Gin Arg Cys Gly Gly 
65 70 75 80 

Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gin His Gin 
85 90 95 

Val Arg Met Gin lie Leu Met lie Arg Tyr Pro Ser Ser Gin Leu Gly 
100 105 110 

Glu Met Ser Leu Glu Glu His Ser Gin Cys Glu Cys Arg Pro Lys Lys 
115 120 125 



Lys Asp Ser Ala Val Lys Pro Asp Arg Cys Arg Lys Leu Arg Arg 
130 135 140 



WO 96/27007 



PCT/AU96/00094 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 910 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 3.. 305 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



CC ATG AGC CCT CTG CTC CGC CGC CTG CTG CTC GCC GCA CTC CTG CAG 
Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin 



CTG GCC CCC GCC CAG GCC CCT GTC TCC CAG CCT GAT GCC CCT GGC CAC 
Leu Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His 



CAG AGG AAA GTG GTG TCA TGG ATA GAT GTG TAT ACT CGC GCT ACC TGC 
Gin Arg Lys Val Val Ser Trp He Asp Val Tyr Thr Arg Ala Thr Cys 



CAG CCC CGG GAG GTG GTG GTG CCC TTG ACT GTG GAG CTC ATG GGC ACC 
Gin Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr 



GTG GCC AAA CAG CTG GTG CCC AGC TGC GTG ACT GTG CAG CGC TGT GGT 
Val Ala Lys Gin Leu Val Pro Ser Cys Val Thr Val Gin Arg Cys Gly 



GGC TGC TGC CCT GAC GAT GGC CTG GAG TGT GTG CCC ACT GGG CAG CAC 
Gly Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gin His 



CAA GTC CGG ATG CAG ACC TAAAAAAAAG GACAGTGCTG TGAAGCCAGA 335 
Gin Val Arg Met Gin Thr 
100 

CAGGGCTGCC ACTCCCCACC ACCGTCCCCA GCCCCGTTCT GTTCCGGGCT GGGACTCTGC 395 

CCCCGGAGCA CCCTCCCCAG CTGACATCAC CCATCCCACT CCAGCCCCAG GCCCCTCTGC 4 55 

CCACGCTGCA CCCAGCACCA CCAGCGCCCT GACCCCCGGA CCTGCCGCTG CCGCTGCCGA 515 

CGCCGCAGCT TCCTCCGTTG CCAAGGGCGG GGCTTAGAGC TCAACCCAGA CACCTGCAGG 575 

TGCCGGAAGC TGCGAAGGTG ACACATGGCT TTTCAGACTC AGCAGGGTGA CTTGCCTCAG 63 5 

AGGCTATATC C CAGTGGGG A ACAAAGAGGA GCCTGGTAAA AAACAGCCAA GCCCCCAAGA 6 95 
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CCTCAGCCCA GGCAGAAGCT GCTCTAGGAC CTGGGCCTCT CAGAGGGCTC TTCTGCCATC 75 5 
CCTTGTCTCC CTGAGGCCAT CATCAAACAG GACAGAGTTG GAAGAGGAGA CTGGGAGGCA 815 
GCAAGAGGGG TCACATACCA GCTCAGGGGA GAATGGAGTA CTGTCTCAGT TTCTAACCAC 87 5 
TCTGTGCAAG TAAGCATCTT ACAACTGGCT CTTCC 910 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gin Leu 
15 10 15 

Ala Pro Ala Gin Ala Pro Val Ser Gin Pro Asp Ala Pro Gly His Gin 
20 25 30 

Arg Lys Val Val Ser Trp lie Asp Val Tyr Thr Arg Ala Thr Cys Gin 
35 40 45 

Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr Val 
50 55 60 

Ala Lys Gin Leu Val Pro Ser Cys Val Thr Val Gin Arg Cys Gly Gly 



Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gin His Gin 
85 90 95 

Val Arg Met Gin Thr 
100 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



ACCACCACCT CCCTGGGCTG GCATGTGGCA CGTGCATAAA CG 42 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



AGTTGTTTGA CCACATTGCC CATGAGTTCC ATGCTCAGAG GC 42 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 



GATCCTGGGG CTGGAGTGGG ATGGATGATG TCAGCTGG 



38 
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(2) INFORMATION rUR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Oligonucleotide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



GCGGGCAGAG GATCCTGGGG CTGTCTGGCC TCACAGCACT 



40 



